CLAIMS 

What is claimed is: 

1. \ method of performing a discrete cosine transform (DCT) using a microprocessor 
having \n instruction set that includes SIMD floating point instructions, wherein the 
method comprises: 

receiving a block of integer data having C columns and R rows, wherein the block 
of mteger data is indicative of a portion of an image; and 

for each row,\ 

loading tnte row data into registers; 

converting thVrow data into floating point form, wherein the registers each 

hold two floating point row data values; and 
performing a plurality\of weighted-rotation operations on the values in the 

registers, whereinNhe weighted-rotation operations are performed 

using SIMD floating point instructions. 

2. The method of claim 1, wherein said convaling is accomplished using the pi2fw 
instruction. \ 

3. The method of claim 1, wherein said weighted-rotarbn operations are accomplished 
using the pswap, pfinul, and pfpnacc instructions. \ 

4. The method of claim 1, further comprising: \ 

for each row, 
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altering the arrangement of values in the registers; 

performing a second plurahty of weighted-rotation operations on the 

\ values in the registers; 
again altering the arrangement of the values in the registers; 
performing a third plurality of weighted-rotation operations on the values 
in tAe registers; 

yet again altering the arrangement of the values in the registers; and 
performing a fourth plurality of weighted-rotation operations on the values 
in the registers to obtain intermediate floating point values. 

5. The method of claim 4, further clijmprising: 

for each row, 

storing the intermediate ^floating point values to an intermediate buffer. 

6. The method of claim 5, further comprising: 

for two columns at a time, 

loading data from two colurJips of intermediate data into each of a 

plurality of registers; 
performing a plurality of weighted-Ajtation operations on the values in the 
registers, wherein the weimted-rotation operations for two 
columns are performed in parMlel using SIMD floating point 



mstructions. 
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7. ^H^^ethod of claim 6, wherein said weighted-rotation operations for two columns at a 
time aresaccomplished using pfmul, pfsub, and pfadd instructions. 

8. The method of claim 6, further comprising: 

for twoVolumns at a time, 

as>^ach weighted-rotation operation is done, storing weighted-rotation 
.operation results to the intermediate buffer. 

9. The method of claim V further comprising: 

for two columns at \ time, 

retrieving weigljted-rotation operation results from the intermediate buffer; 
performing a sei^pnd plurality of weighted-rotation operations on the 
retrieved vames; 

again storing weightedVotation operation results to the intermediate buffer 
as the weighted-rotation operations of the second plurality are 
done; 

again retrieving weighted-rotati\n operation results from the intermediate 
buffer; 

performing a third plurality of w^hted-rotation operations on the 
retrieved values; 

yet again storing weighted-rotation operatioV results to the intermediate 
buffer as the weighted-rotation operation^of the third plurality are 
done; 
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ye\ again retrieving weighted-rotation operation results from the 

intermediate buffer; 
performh^ a fourth plurality of weighted-rotation operations on the 
retriteved values; 

5 converting thaweighted-rotation operation results from the fourth plurality 

to integeXresults. 

10. The method of claim 9, fiirtheXcomprising: 

for two columns at a time, writing the integer results to an output buffer. 

1 1 . A method of performing a discrete cosine transform (DCT) using a microprocessor 
10 having an instruction set that includes ^IMD floating point instructions, wherein the 

method comprises: 

receiving a block of integer data having C columns and R rows; and 
for two columns at a time, 

loading column data into registers; 
15 converting the column data into floating point form, wherein the registers 

each hold a floating point colum^ data value from two columns; 
and 

performing a plurality of weighted-rotation operations on the values in the 
registers, wherein the weighted-rotatron operations for two 
20 columns are performed in parallel using\SIMD floating point 

instructions. 
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12. Trae method of claim 11, wherein said weighted-rotation operations are accomplished 
using pnnul, pfsub, and pfadd instructions. 

13. The merhod of claim 11, further comprising: 

for two columns at a time, 

as each weighted-rotation operation is done, storing weighted-rotation 
aeration results to an intermediate buffer. 

14. The method of claim 13u further comprising: 

for two columns at a time, 

retrieving weighted-rotation operation results from the intermediate buffer; 
performing a second plurality of weighted-rotation operations on the 
retrieved values\ 

again storing weighted-rottition operation results to the intermediate buffer 
as the weighted-rotaision operations of the second plurality are 
done; \ 

again retrieving weighted-rotation (smeration results from the intermediate 
buffer; \ 

performing a third plurality of weigmed-rotation operations on the 
retrieved values; \ 

yet again storing weighted-rotation operation results to the intermediate 
buffer as the weighted-rotation operations Xf the third plurality are 
done; \ 
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^et again retrieving weighted-rotation operation results from the 

intermediate buffer; 
perfo^ing a fourth plurality of weighted-rotation operations on the 
tetrieved values; 

converting the weighted-rotation operation results from the fourth plurality 
to inWer results. 
15. The method of claim 14, mrther comprising: 

for two columns at a time, writing the integer results to an output buffer. 



10 16. A computer system comprising: 

a processor having an insti\jction set that includes SIMD floating point 
instructions; and 

a memory coupled to the processor, wherein the memory stores software 
instructions executable by th^ processor to implement the method of 
15 receiving a block of integer dataNhaving C columns and R rows, wherein 

the block of integer data is indicativ\of a portion of an image; and 
for each row, 

loading the row data into registers; 
converting the row data into floating point foW, wherein the registers each 
20 hold two floating point row data values;\nd 
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performing a plurality of weighted-rotation operations on the values in the 
registers, wherein the weighted-rotation operations are performed 
ising SIMD floating point instructions. 



5 17. A carrier medium comprmng software instructions executable by a microprocessor 
having an instruction set that incVdes SIMD floating point instructions to implement a 
method of performing discrete cosineVansform (DCT), wherein the method comprises: 

receiving a block of integer data ftaving C columns and R rows, wherein the block 
of integer data is indicative o\a portion of an image; and 
10 for each row, 

loading the row data into registers; 
converting the row data into floating pdmt form, wherein the registers each 

hold two floating point row data vmies; and 
performing a plurality of weighted-rotation operations on the values in the 
15 registers, wherein the weighted-rotatio\ operations are performed 

using SIMD floating point instructions. 
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